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What Is Claimed is* 

1 . A program conversion apparatus for converting a ffi ven 
source projjram into a program for a mult i- thread processor 
including a plurality of program counters and a plurality of 
thread execution apparatus , said plurality of thread execution 
apparatus being operable to fetch , decode and execute a plural i ty 
uf instructions of threads simultaneously m accordance with 
said plurality of program counters such "hat it is possibtp 
tfo execute, after a 'hreari : s created, zhe rnreaa m a control 
speculative mode wherein a change Having had an effect on a 
register set can be cancelled later and to execute the thread 
in a data -dependent speculative mode wherein, when, after a 
self thread loads a value from a memory location, a parent thread 
by which the self thread has been created stores a value into 
the same memory location, at least a processing result of the 
self thread after the load is abandoned and the processing is 
re-executed, said nni i ti-thread processor having an instruction 
set with which it can be executed by a single machine instruct! nn 
or a combination of several machine instructions for a thread 
being executed by any of said thread execution apparatus tn 
create a new thread of the control speculative mode, to end, 
if a designated condition is satisfied, the self thread and 
clear the control speculative mode of a thread of the control 
speculative mode created by the self thread, to abandon thp 
created ihread of the control speculative mode, to give, ^hen 
a thread created by the self thread performs load from a memory 
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location of a designated address, an instruction in advance 
to temporarily block the operation, to clear the load temporary 
blocking instruct inn to the designated memory address , for the 
thread being executed by the thread execution apparatus to create 
a new thread of the data-dependent speculative mode and to clear 
the data-dependent speculative mode of the thread of the 
data- dependent speculative mode created by the self thread, 
said program conversion apparatus comprising: 

a register al location trial section for *rymg register 
allocation prior zo paraileiizauitm lu estimate a register 
allocation situation or variables and intermediate terms of 
an intermediate program; 

a fork spot determination section for determining based 
on a result of the register allocation trial by said register 
allocation trial section whether or not a conditional branch 
portion of the intermediate program shuuld be converted into 
a parallel code for whi ch a thread creation instruction is used 
and determining a parallelization execution method with the 
parallel code; 

an instruction reordering section for converting the 
conditional branch portion in the intermediate program into 
a parallel code for which the thread creation instruction is 
used based on a result of the determination by said fork spot 
determination sect ion and referring to the result of the register 
allocation trial to insert an instruction for assuring a 
data -dependence relationship between threads through a memory 
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into positions before and after the thread creation instruction 
and reorder the instructions before and after the thread creation 
instruction so that thread creation may be performed in an early 
stage; and 

5 a register allocation section for performing definite 

register allocation so that , regarding whether or not a physical 
register is allocated to the parallelized and reordered 
instruction sequence, zaz same aiiocauxun result as *hat upon 
the register allocation trial may be obtained. 

10 2. A program conversion apparatus as claimed in claim 

1, wherein said fork spot determination section investigates 
a data dependence relationship uhruugh a memory from a basic 
block in the intermediate program which is a processing object 
at present to each of basic blocks of branching destinations 

15 of a conditional branching instruction positioned at the tail 
end of the basic block, counts, fur each of the branching 
destinations, the instruction step number, from the top of the 
branching destination basic block, of the instruction at the 
top one of memory reference instructions in the branching 

20 destination basic block which cause the data dependence, and 
selects that one of the nranching destination basic blocks whose 
instruction 3tep number is greater as a new thread to be executed 
parallelly . 

3. A program conversion apparatus as claimed in claim 
25 2, wherein said fork spot determination section determines the 
position of a data -dependent instruction through a memory in 



- 65 



each branching destination basic block using a value obtained 
by accumulating estimated sxeeuuion cycle numbers of 
instructions in plac* of the instruction step number . 

4. A program conversion apparatus as claimed in claim 
5 1 , wherein, upon conversion from a source program into a target 
program first, address coordination information for 
establishing coordination between the basic blocks of the 
intermediate program in said program conversion apparatus and 
machine language addresses ox the target program to oc outputted 

10 is outputted together with -he var^e* program, and i :arge~ 
program execution apparatus reads in the target program and 
the address coordination information and executes the target 
program and then outputs profile information including branch 
profile information between basic blocks upon the execution 

15 of the target program and data dependence information occurring 
through a memory between the basic blocks, whereafter, when 
said program conversion apparatus parallelizes the source 
program to convert the source prugram into a target program, 
said fork spot determination section refers to the profile 

20 information to preferentially select a branching destination 
basic block to which control flows in a high probability at 
a conditional branch and another branching destination "nasi c. 
block with whir.h data dependence occurs in a low probability 
at a conditional branch as anew thread to be executed paral 1 el 1 y . 

25 5. A program conversion apparatus as claimed m claim 

4, wherein said fork spot determination section produces an 
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instruction to cause a conditional branching destination basic 
block selected as an execution si art point of the new thread 
to be executed para! 1 eiiy to temporarily block, when the number 
of spots of different meaory addresses which cause date 
5 dependence is smal ler than a predetermined number based on a 
result of an analysis of data dependence through a memory m 
the intermediate program and a data dependence occurrence 
probability obtained from -he urofiie information, "oari 
uperation of the new thread from rhe memory addresses, but 

10 investigates, when the number jf spots of different memory 
addresses which cause data dependence is equal to or greater 
than thepredcterminednumber , whether or not the data dependence 
occurrence probability is lower than a predetermined 
probability and produces, if the probability is lower, an 

15 instruction to create a new thread in the data -dependent 
speculative mode but controls, if the probability is equal to 
or higher than the predetermined probability, so as to stop 
the parallclization conversion at Ihe spot. 

6. A program conversion apparatus ae claimed in claim 

20 4, wherein said fork spot determination section investigates 
a data dependence relationship through a memory from the basic 
block in the intermediate program currently which is a processing 
object at present, to each of the branching destination basic 
blocks of the conditional branching instruction positioned at 

25 the tail end of the basic block and synthesizes the investigated 
data dependence relationship and the conditional branching 
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probability obtained from the profile information, and if a 
result of the synthesis reveals that the branching probabilities 
regarding the branching destination basic blocks at the 
conditional branch do not have a difference greater than a 
predetermined amount and data dependence occurrence timings 
through a memory do not have a difference greater than a 
predetermined amount, said fnrfc spot determination section 
determines so as not to parallelize the conditional branching 
portion. 

a program conversion apparatus, comprx&iiiK : 
a syntax analysis section for analyzing the syntax of 

a source program to produce an intermediate program: 

a parallelization section for performing optimization 

processing including parallelization for lay intermediate 

program; and 

a code generation section for producing a target program 
including an instruction code tor a target processor apparatus 
from the intermediate program optimized by said parallelization 
section; 

said parallelization section including an intermediate 
program inputting section for reading in the intermediate 
program and analyzing a control flow and a data flow, a register 
allocation section for trying to perform register allocation 
prior to parallelization to estimate a register allocation 
situation of variables and intermediate terms of the 
intermediate program and executing allocation of registers. 
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a fork spot determination action for determining, based on 
a result of the trial of the register allocation, a spot of 
a conditional branch portion of the intermediate program to 
be converted into a parallel code for which a thread creation 
5 instruction is used, an instruction reordering section for 
performing reordering of instructions before and after the 
parallel! zat ion spot from information of the parallelization 
spot determined by 3aid fork spot determination section, Zhe 
data flow and 30 forth, ^no an intermediate program outputting 

10 section for outputting the instruction sequence for which the 
conversion including the psraiieiization has been completed 
in a format of ^hc intermediate program again. 

3. A program conversion apparatus as claimed m claim 
7, wherein said parallelization section includes a profile 

15 information inputting sent i on for receivingprofile information 
outputted from the target processor apparatus as a result of 
execution of the target program and converting the profile 
information into information of an internal formal, and said 
fork spot determination section determines, based on the result 

20 of the register allocation trial and the profile information, 
a spot of a conditional branch portion of the intermediate program 
to be converted into a parallel code in which the thread 
generation code is used anrJ determines a parallelization 
execution method by the parallel code. 

25 9. A program conversion apparatus as claimed in claim 

7. wherein 3aid instruction reordering seccion converts 



- 69 - 



conditional branching port ions in the intermediate program into 
a parallel code in which the thread citation instruction is 
used based on a result of the determination by said fork spot 
determination section, refers to the result uf the register 
allocation trial to insert an instruction for assuring a data 
dependence relationship between threads lluuugh a memory into 
positions before and aft&r the thread creation instruction and 
reorders the instructions before and after the thread creation 
instruction so that thread creation may be performed m an early 
stage. 

10. A program conversion apparatus as claimed in claim 
7 j wherein said register allocation section performs definite 
register allocation snr,nat ? regarding whether or not a physical 
register is allocated to the parallelized and reordered 
instruction sequence, the same allocation result as that upon 
the register allocation trial may be obtained. 

11. A program conversion apparatus for performing 
optimization processing including parallelization of an 
intermediate program obtained by a syntax analysis of a source 
program performed by a syntax analysis section so that the 
intermediate program may be suitable for a target processor 
apparatus , compr i s ing : 

register allocation trial means for trying allocation 
of registers of the target processor apparatus on the 
intermediate program and obtaining register allocation 
information prior to actual allocation; 
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means for calculating a distance of data dependence 
generated through a memory in the target processor apparatus 
for the intermediate program; 

means for determining a fork designation taking the 
S distance of data dependence through a memory on the intermediate 
program into consideration and replacing a conditional branch 
with a thread creation instruction; and 

means for referring to a result of the register allocation 
trial to reorder the instructions before ana after the thread 
10 creation instruction on the intermediate program. 

12, A program conversion apparatus according to claim 
11 , wherein said means for replacing a conditional branch with 
a thread creation instruction ^nc tildes: 

means for calculating a minimum value of data dependence 
15 of intermediate terms and vari ahl es for each of the two branching 
destinations of the conditional branch; and 

means for comparing the two minimum va lues of the distance 
of data dependence determined for the two branches of the 
conditional branch, determining, when the two minimum values 
20 have a difference greater than a predetermined value, the 
branching direction of the branch which exhibits the higher 
minimum value of the distance of data dependence as a fork 
destination and selecting the condi tionai branch spot as a fork 
spot, but determining, when the two minimum values of the distance 
25 of data dependence do not have a difference equal to or greater 
than the predetermined value , that branching destination which 
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lias been a brandling destination in tftfi original intermediate 
program as a fork destination and selecting the conditional 
branch spot as a fork spot candidate. 

13. A program conversion apparatus as claimed in claim 
11, further comprising: 

means for receiving profile information outputted froia 
a processor apparatus which executes the target; program 
outputted from said program conversion apparatus and 
calculating * conditional branching probability *nci a aata 
dependence occurrence frequency from the profile information; 
and 

means for determining a fork destination and a data 
dependence assurance system from the distance of ctata dependence , 
the conditional branch probability and the data dependence 
occurrence frequency , and the number of spots of different memory 
addresses which cause data dependence and replacing the 
coudi Clonal branch with the thread creation instruction. 

14 . A program conversion apparatus as claimed in claim 
7, wherein the target processor apparatus is a roulti- thread 
processor which includes a plurality of program counters and 
a plurality of thread execution apparatus, said plurality of 
thread execution apparatus being operable to fetch, decode and 
execute a plurality of instructions of threads simultaneously 
in accordance with said plurality of program counters such that 
il is possible to execute, after a thread is created, the thread 
in a control speculative mode therein a change having had an 
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effect on a register set can be canceled later and to execute 
the thread in a data -dependent speculative mode wherein, when , 
after a self thread loads a value from a memory location, a 
parent thread by which the self thread has been created stores; 
a value into the same memory location , at least a processing 
result of the self thread after the load is- abandoned and the 
processing is re-executed said multi-thread processor havxng 
an instruction set with which jan oe executed by a single 
machine Instruction or a combination of several machine 
instructions for a thread being scouted by any of said thread 
execution apparatus to create a new thread of the control 
speculative mode, to end, if a designated condition is satisfied . 
Lhe self thread and nlear the control speculative mode of a 
thread of the control speculative mode created by the self thread . 
to abandon the created thread of the control speculative mode, 
to give, when a thread created by the self thread performs load 
from a memory location of a designated address , an instruction 
in advance to temporarily block the operation, to clear the 
load temporary blocking instruction to the designated memory 
address , for the thread being executed by the thread executi on 
apparatus to create a new thread of the data -dependent 
speculative mode and to clear the data-dependent speculative 
mode of the thread of the data-dependent speculative mode created 
by the self thread. 

15, A program conversion method for performing an 
optimization process including uaraiieiization for *n 
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intermediate program ontpntted as a result of a syntax analysis 
on a program conversion apparatus which compiles a source program 
and outputs a target program for a target processing apparatus 
of the multi- thread type, comprising; 

a register allocation trial step of trying register 
allocation prior to parallelization to estimate a register 
allocation situation of vanabl.es and intermediate terms of 
the intermediate program; 

a fork spot determination step 01 determining based on 
a result of the register allocation ;rial *heiher or not a 
conditional branch portion of the intermediate program should 
be converted into a parallel code for whicu a thread creation 
instruction is used or performing determination of whether or 
not the conditional branch portion should be cuaverLed into 
a parallel code and, when surh conversion should be performed, 
determination of a parallelization execution method; 

an instruction reordering step of converting the 
conditional branch portion in the intermediate program into 
a parallel code for which the thread creation instruction is 
used based on a result of the determination by the fork spot 
determination step and referring to the result of the register 
allocation trial to insert an instruction for assuring a 
data -dependence relationship between threads through a memory 
into positions before and after the thread creation instruction 
and reorder the instruct i ons before and after the thread creation 
instruction so that thread creation may be performed in an early 
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stage; and 

a register allocation step of performing definite 
register allocation so that the same allocation result as that 
upon the register allocation trial may be obtained for the 
5 parallelized and reordered instruction sequence. 

16. A program conversion method as claimed in claim 15, 
wherein the fork spot determination 3tep means calculates * 
minimum value of data dependence of intermediate terms and 
variables for each of the two oranchmg destinations nf me 

10 conditional branch, compares the two minimum -'a±ue3 ot trie 
distance of data dependence determined for the two branches 
of the conditional branch, determines, when the two minimum 
values have a difference greater than a predetermined value, 
the branching direction of the branch which exhibits the higher 

15 minimum value of the distance of data dependence as a fork 
destination and selects the conditional branch spot aa a fork 
spot, but determines, when the two minimum values of the distance 
of data dependence do not have a difference equal to or greater 
than the predetermined value, that branching destination which 

20 has been a branching destination in the original intermediate 
program as a fork destination and selects the conditi onal branch 
spot as a fork spot candidate. 

17. A program conversion method as claimed in claim 15, 
wherein the fork spot determination step investigates a data 

25 dependence relationship through a memory from a basic block 
having no branch and no confluence from within the intermediate 
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program which is a processing object at present to each of basic 
blocks of branching destinations of a conditional branching 
instructionpositionedatthetai i endof thebasicblock, counts, 
for each of the branching destinations, the instruction step 
mimhpr from the top of the branching destination basic block, 
of the instruction at the top one of memory reference instructions 
in the branching destination basic block which cause the data 
dependence, and selects that one of the branching destination 
basic blocks whose instruction step number is greater as a new 
thread to be executed paraiielly 

18 . A program conversion method as claimed in claim 17 . 
wherein the fork spot determination step determines the position 
of a data -dependent instruction through a memory in each 
branching destination basic block using a value obtained by 
accumulating estimated execution cycle numbers of instructions 
in place of the instruction step number. 

19, A program conversion method as claimed in claim 15, 
wherein, upon conversion from a source program inlo a target 
program first by said program conversion apparatus, address 
coordination information for establishing coordination between 
the basic blocks of the intermediate program andmachine language 
addresses of the target program to be uulputted is outputted 
together with the target program, and a processor apparatus 
which is to execute the ob j ect program reads in the target program 
and the address coordination information and executes the target 
program and then outputs profile iulunuation including branch 
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profile information between basic blocks upon the execution 
of the target program and data dependence iaf urination occurring 
through a memory between the basic blocks, whereafter, when 
said program conversion apparatus parallelizes the source 
5 program to convert the source program into a target program, 
the fork spot determination step refers to the profile 
information to preferentia * iy select a branching destination 
basic block zo vmicn control flows m a jrooaui-ity at 

a conditional oranch and another ©ranching destination basic 

10 block with which data dependence occurs m a _uw urooaoiiity 
at a conditional branch as a new thread to be executed paralleliy . 

20. a program conversion method as claimed in claim 19. 
wherein the fork spot determination step produces an instruction 
to cause a conditional branching destination basic block 

15 selected as an execution start point of the new thread to be 
executed paralleliy to temporarily block, when the number of 
spots of different memory addresses which cause date dependence 
is smaller than a predetermined number based on a result of 
an analysis of data dependence through a memory in the 

20 intermediate program and a data dependence occurrence 
probability obtained from the profile information, load 
operation of the new thread from the memory addresses, but 
investigates, when th« number of spots of different memory 
addresses which- cause data dependence is equal to or greater 

25 than the predetermined number , whether or not the data dependence 
occurrence probability is lower than <± ^redetermined 
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probability and produces, if the probability is lower, an 
instruction to create a new thread in the data-dependent 
speculative mode but controls, if the probability is equal tn 
or higher than the predetermined probability, 30 as to atop 
5 the parallelization conversion at the spot, 

21 . A program conversion apparatus as claimed in claim 
19, wherein the fork spot determination step investigates a 
data dependence relationship through a memory from the basic 
block iu the intermediate program currently which is aprncessmg 

10 object at present zo each of the branching destination basic 
blocks of uhe conditional branching instruction positioned at. 
the tail end of the basic block and synthesizes the investigated 
data dependence relationship and the conditional branching 
probability obtained from the profile information, and if a 

15 result of the synthesis reveals that the branching probabi 1 i ti es 
regarding the branching destination basic blocks at the 
conditional branch do not have a difference greater than a 
predetermined amount and data dependence occurrence timings 
through a memory do not have a difference greater than a 

20 predetermined amount > said fork spot determination section 
deieriuines so as not to parallelize the conditional branching 
portion. 

22. A program conversion method as claimed in claim is, 
wherein the fork spot determination step includes the steps 

25 of: 

discriminating whether or not the conditional branching 
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instruction corresponds to a return branch of a loop structure 
in the intermediate program; 

determining, when tile conditional branch ing instruction 
corresponds to a loop return branch, the direction of the return 
5 branch, which is a loop continuing direction, as a fork 
destination and selecting the conditional branch spot as a fork 
spot . 

but calculating, when the conditional branching 
lusirucu^un not a loop return branch, a minimum va±ue ox 
10 the distance of data dependence of intermediate terms / variables 
fur each of the two branch dest inat ions of the conditional branch ; 

comparing the two minimum values of the distance of data 
dependence determined with regard to the two branches of the 
conditional branch with each other to discriminate whether or 
15 not Ike two minimum values have a difference greater than a 
predetermined value; and 

determining, when the two minimum values of the distance 
of data dependence have a difference greater than the 
predetermined value, the branch which exhibits a Larger one 
20 of the minimum values of the distance of data dependence as 
a fork destination and selecting the conditional branching spot- 
as a fork spot; but 

determining, when the two minimum va Sues of the distance 
of data dependence do not have a difference greater than the 
25 predetermined value, that one of the branches which Has been 
a branch destination in the original intermediate program as 
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a fork destination and selecting tlie conditional branching spot 
as a fork candidate. 

23. A program conversion method as claimed in claim 22, 
wherein the distance of data dependence is the number of steps 
5 in Lhe intermediate program which represents at what distance 
from the top of a basic program of the branching destination 
lilts memory reference instruction is positioned with regard tn 
each of the intermediate terms and variables which are defined 
iu Lhe basic block which is a processing object at present ana 

10 may possibly be referred zo in the branching destination and 
besides are estimated to be arranged on the memory. 

24 . A program conversion method as claimed in claim 22 , 
wherein, upon the determination of the distance of data 
dependence, the number of cycles estimated to be required when 

15 pertaining instructions are executed on a processor of an obj ect 
architecture . 

25. A program conversion method as claimed in claim 15, 
wherein the instruction reordering step includes : 

a first step of investigating an allocation situation 
20 of registers with regard to whether each of intermediate terms 
and variables in the intermediate program is coordinated with 
a register or a memory; 

a second step of replacing a branching instruction 
positioned at the tail end of a basic instruction which is a 
25 processing object at present with a control speculation mode 
PORK instruction while the fork destination which is an operand 
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of the control speculation morte KQKK instruction is determined 
as the fork destination selected by the fork spot determination 
step; 

a third step of moving a branching condition expression 
positioned inmtediately prior to the control speculation mode 
FOBS instruction in the intermediate instruction to the uusilion 
immediately next to the control speculation mode FORK 
instruction and inserting to the tail end of the oaaic block, 
which is che destination of ~ne movement of che oranchmg 
condition expression, an instruction 3equence for ending, *hen 
the branching condition is sari sf i ed . the self thread and placing 
a child thread into a settlement; mode ^rhich is a auu- control 
speculation mode , £ut abandoning, when the branching condition 
is not satisfied, the child thread and keeping the self thread 
to continue execution of a succeeding instruction train; 

a fourth step of moving each of instruction statements 
which are on the upstream side with respect to the control 
speculation mode FORK instruction in the basic block being a 
processing object at present and are to be substituted into 
intermediate terms and variables coordinated with a memory to 
a position on the downstream side with respect to the control 
speculation PORK instruction and inserting a BLOCK setting 
instruction to the position immediately prior to the control 
speculation mode FORK instruction while inserting a BLOCK clear 
instruction to the position immediately next to the movement 
destination of the substitute statement; and 
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a fifth step of issuing an instruction to allocate the 
registers in accordance with the register allocation situation 
assumed by the fork conversion processing in the second step. 

26 . A program conversion method as claimed in claim 15 , 
wherein the fork spot determination step includes: 

a first step of discriminating whether ur not the 
conditional branching instruction corresponds to a return 
branch of a loop structure in the intermediate program; 

a second step if -irnvi sionaiiy determining, when the 
conditional branching instruction corresponds zo a return 
branch of a loop structure, the direction of the return branch 
as a fork destination: 

a third step of calculating, based on the received profile 
information, a probability with which a taken side of the 
conditional branching instruction is selected and another 
probability with which a fall-through side of Ike conditional 
branching instruction is selected; 

a fourth step of discriminating whether or not the two 
calculated probabilities of the branches have a difference 
greater than a predetermined value; 

a fifth step of provisionally determining, if the 
difference between the two probabilities of Ike branches is 
greater than the predetermined value , the branch which exhibits 
the higher probability as a fork destination; 

a sixth Step of calcul ating a minimum value of the distance 
of data dependence for each of the two branching destinations 
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of the conditional branch; 

a seventh step of comparing the two minimum values of 
the distance of data dependence determined with regard to the 
two branches of the conditional branch with each other to 
5 discriminate whether or not the two minimum values have a 
difference greater than a predetermined value; 

an eighth step of determining, when the two minimum values 
of the distance of data dependence have a difference greater 
Lhaii uhe predetermined value or no data dependence is r'ouna. 
10 that one of che branches which exhibits a Higher one of the 
minimum values of the distance of data dependence as a fork 
destination: 

a ninth step of calculating a mini mum va me of the distance 
of data dependence for the fork destination provisionally 

IS determined in the second step or the fifth step and di sori mi nating 
whether or not the minimum value of the distance of data 
dependence of the provisionally determined fork destination 
is equal to or higher than a predetermined value; 

a tenth step of settling, if the minimum value of the 

20 distance of data dependence of the provisionally determined 
fork destination is equal to or higher than a predetermined 
value or no data dependence through a memory is found, the fork 
destination provisionally determined in the second step or the 
fifth step as a- formal fork destination; 

25 an eleventh step of excepting, when it i$ determined that 

the minimum value of distance of data dependence is lower than 
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the predetermined value, the basic block from a fork spot; 

a twelfth step of calculating a data dependence occurrence 
frequency from the received profile information; 

a thirteenth step of discriminating whether or not the 
data dependence occurrence frequency is equal to or higher than 
a fixed level, counting, when the data dependence occurrence 
frequency is lower than the fixed level, the number of 
intermediate terms/ variables on the memory which may possibly 
cause data dependence from ^he basic olock of the forte source 
to the basic block of the fork destination in the intermediate 
program, discriminating whether or not the emmt value 
representing a data -dependent spot number is equal to or higher 
than a fixed level, determining that a fork according zr\9 DSP 
system is used if the data -dependent spot number is equal to 
or higher than the fixed level but determining that a fork 
according to the BLOCK system is used if the data-dependent 
spot number is lower than the fixed level, and providing 
information of the fork to the FORK instruction in the 
intermediate program; and 

a fourteenth step of counting, when the data dependence 
occurrence frequency is equal to or higher than the f i x«rt i evei , 
the number of data -dependent variables on the memory and 
determining that a fork according to the BLOCK system is used 
if the counted number is smaller than a fixed level but removing 
the basic block from a fork candidate if the counted number 
is equal to or greater than the fixed level 
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27. A program conversion method as claimed in claim 15. 
wherein the instruction reordering step includes: 

a first step of investigating an allocation situation 
of registers with regard to whether each of intermediate terms 
and variables in the intermediate program is coordinated with 
a register or a memory; 

a second step of replacing a branching instruction 
positioned at the tail and of a, basic instruction which <x 
processing object at present wi-h ^ "octroi speculation mode 
PORK instruction while the fork destination which is an operand 
of the control speculation mode FORK instruction is determined 
as the fork destination selected by the foi*k spot determination 
step; 

a third step of moving a branching condition expression 
positioned immediately prior to the control speculation FORK 
instruction in the intermediate instruction to Che position 
immediately next to the control speculation FORK instruction 
and inserting to the tail end of the basic block, which is the 
destination of the movement of the branching condition 
expression, an instruction sequence for ending, when the 
branching condition is satisfied, the self thread and placing 
a child thread into a settlement mode which is a non-control 
speculation mode , but abandoning, when the branching condition 
is not satisfied, the child thread and keeping the self thread 
to continue execution of a succeeding instruction train; 

a forth 3tep of checking whether a Turk data assurance 
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systeiuuf Hie fork spot determined by the fork spot determination 
step is the BLOCK system or the DSP system; 

it fifth step of moving , when the fork data assurance system 
is the BLOCK systom, a memory store statement prior to the fork 
5 to a position after the fork, inserting necessary BLOCK setting 
and BLOCK clear instructions, inspecting, upon the movement, 
d data dependence relationship and moving only those 
instructions a change of whose instruction execution order does 
not change d result of arithmetic operation, 

10 a sixth step of modifying, when the fork data assurance 

system ia lire DSP system, the FORK instruction produced by 
replacement in the second step so that a substitute statement 
into an intermediate term coordinated with a memory i a mnvpci 
to a position next to the PORK instruction to perform the fork 

15 in a data-dependent speculation mode; and 

a seventh step of issuing an instruction to allocate the 
registers in accordance with the register allocation situation 
assumed by the fork conversion process in the second step. 
28. A recording medium on which a program for causing 

20 a computer to perform an optimization process including 
pax-allelization for an intermediate program outputted as a 
result of a syntax analysis on a compiler which compiles a source 
program and produces and outputs a target program for a 
multi-thread processor apparatus is recorded, the optimization 

25 process including: 

a register allocation trial process of trying register 
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allocation prior to parallel i 7sti on to estimate a register 
allocation situation of variables and intermediate terms of 
the intermediate program; 

a fork spot determination process of determining bused 
5 on a result of the register all onati on trial whether or not 
a conditional branch portion of the intermediate program should 
be converted into a parallel code for winch a thread creation 
instruction is used or performing determination of whether or 
not uie conditional branch portion snould be converted into 

10 a parallel code and, when such conversion should be performed, 
determination of a parallel i zati on execution method: 

an instruction reordering process of converting the 
conditional branch portion in the intermediate program into 
a parallel code for which the thread creation instruction is 

15 used based on a result of the determination by the fork spot 
determination step and referring to the result of the register 
allocation trial to insert an instruction for assuring a 
data-dependent relationship between threads through a memory 
into positions before and after the thread creation instruction 

20 andreorder the instructions before and after the thread creation 
instruction so that thread creati on may be performed in an early 
stage; and 

a register allocation process of performing definite 
register allocation so that the same allocation result as that 
25 upon the register allocation trial with regard to whether a 
physical register is allocated may be obtained Tor -;he 
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parallelized and reordered instruction sequence. 

29. A recording medium as claimed in claim 28, wherein 
the fork spot determination process investigates a data 
dependence relationship through a memory from a basic block 
in the intermediate program which is a processing object at 
present to each of basic blocks of branching destinations of 
a conditional branching instruction positioned at the tai ' *nd 
of the basic block, counts, for each of the branching dear mations . 
the instruction seep number, Trom the :op .of the hrwrnng 
destination basic block, of zhc instruction at the zop one of 
memory reference instructions in the branching destination 
basic block which cause the data dependence, and selects that 
one of the branching destination basic blocks whnsff instruction 
step number is greater as anew thread to be executed parallelly. 

30. A medium as claimed in claim 17, wherein the fnrk 
spot determination process determines the position of a 
data- dependent instruction through a memory in each branching 
destination basic block using a value obtained by accumulating 
estimated execution cycle numbers of instructions in place of 
the instruction process number. 

81. A medium as claimed in claim 28, wherein, upon 
conversion from a source program into a target program first 
by said compiler, address coordinate nn information for 
establishing coordination between the basic blocks of the 
intermediate program and machine language anrir^sses of the 
target program to be outputtcd is outputted together with the 
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target program, and a processor apparatus which is to execute 
the object program reads in the target program and the address 
coordination information and executes the target program and 
then outputs profile information including branch profile 
information between basic blocks upon the exeunt i cm of the target 
program and data dependence information occurring through a 
memory between the basic blocks , whereafter . when said compiler 
parallelizes the source program to convert the source program 
into a target program, she fork spot, determination process refers 
to the profile information to preferentially select a branchiae 
destination basic block to which control flows in a high 
probability at a conditional branch and another branching 
destination basic block with which data dependence occurs m 
a low probability at a conditional branch as a new thread Lu 
be executed parallel ly. 

32. A medium as claimed in claim 31, wherein the fork 
spot determination process produces an instruction to cause 
a conditional branching destination basic block selected as 
an execution start point of the new thread to be executed 
parallclly to temporarily block, when the number of spots of 
different memory addresses which cause date dependence is 
smaller than a predetermined number based on a result of an 
analysis of data dependence through a memory in the intermediate 
program and a data dependence occurrence probability obtained 
from the profi le information, load operation of the new thread 
from the memory addresses, but investigates, when llie number 
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of spots of different memory addresses winch cause rtata 
dependence is equal to or greater than the predetermined number , 
whether ur uot the data dependence occurrence probability is 
lower than a predetermined probability and produces , if the 
probability is lower, an instruction to create a new thread 
in the data- dependent speculative mode but controls, if the 
urubabiiity is equal to or higher than the predetermined 
probability, so as to szop the parailcii2ation conversion at 
the $vul 

33. a compiler as claimed in claim 31, wherein the fork 
spot determination process investigates a data dependence 
relationship through a memory from the basic block m the 
intermediate program currently ^vhieh is a processing object 
at present to each of the branching destination basic blocks 
uf the conditional branching instruction positioned at the tax L 
end of the basic block and synthesizes the investigated data 
dependence relationship and the conditional branching 
probability obtained from the profile information, and if a 
result of the synthesis reveals that thehranchingprobabilitxes 
regarding the branching destination basic blocks at the 
conditional branch do not have a difference greater tiian a 
predetermined amount and data dependence occurrence timings 
through a memory do not have a difference greater than a 
predetermined amount, said fork spot determination section 
determines so as not to parallelize the conditiona* branching 
portion . 



